Unicode-8 based linguistics data set of annotated Sindhi text

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development of Unicode based Sindhi Typing System

This paper presents a first attempt in designing and development of Unicode based Sindhi Typing System for the Sindhi speaking community. The Sindhi Typing project is developed in order to improve the typing speed of Sindhi computing professionals as no such system currently exists. It is Platform independent application requiring no third party plugin or any regional languages support. No Sind...

متن کامل

Representing Multimodal Linguistics Annotated data

The question of interoperability for linguistic annotated resources requires to cover different aspects. First, it requires a representation framework making it possible to compare, and potentially merge, different annotation schema. In this paper, a general description level representing the multimodal linguistic annotations is proposed. It focuses on time and data content representation: This...

متن کامل

Towards a Generic Framework for the Development of Unicode Based Digital Sindhi Dictionaries

Dictionaries are essence of any language providing vital linguistic recourse for the language learners, researchers and scholars. This paper focuses on the methodology and techniques used in developing software architecture for a UBSESD (Unicode Based Sindhi to English and English to Sindhi Dictionary). The proposed system provides an accurate solution for construction and representation of Uni...

متن کامل

Sentiment Summerization and Analysis of Sindhi Text

Text corpus is important for assessment of language features and variation analysis. Machine learning techniques identify the language terms, features, text structures and sentiment from linguistic corpus. Sindhi language is one of the oldest languages of the world having proper script and complete grammar. Sindhi is remained less resourced language computationally even in this digital era. Vie...

متن کامل

Text Based Interactive Fiction and Computational Linguistics

Interactive ction (IF) or text adventures are text-based computer games. After a short introduction I will explain some details of an IF authoring system. As an illustration I will give an outline of my own project. I will conclude by discussing some scientiic attempts to improve the genre using theories of artiicial intelligence and computational linguistics. 1 What is interactive ction (IF) ?...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data in Brief

سال: 2018

ISSN: 2352-3409

DOI: 10.1016/j.dib.2018.05.062